Survey on Data Deduplication for Cloud Storage to Reduce Fragmentation
نویسندگان
چکیده
Data Deduplication is an important technique which provides better result to store more information with less space. Cost and maintenance of Information backup storage system for major enterprises can be minimized by storing it on Cloud Storage. Data redundancy between different kinds of data storage gets minimal by utilizing data deduplication method. By giving each application differently and storing the associated information distinctly the overall disk usage can be enhanced to a great level. Cloud backup systems uses data deduplication to eliminate duplicate chunks that are present in multiple files. The duplicate chunks are substituted with the references to already present chunks through deduplication, without storing it again on cloud storage. The successive chunks are actually stored in scattered form in backup system in numerous segments (the storage unit of cloud).
منابع مشابه
Survey on Fragmentation for Deduplication in Backup Storage
In backup environments field deduplication yields major advantages. Deduplication is process of automatic elimination of duplicate data in a storage system and it is most effective technique to reduce storage costs. Deduplication effects predictably in data fragmentation, because logically continuous data is spread across many disk locations. Fragmentation mainly caused by duplicates from previ...
متن کاملIn-line Deduplication for Cloud storage to Reduce Fragmentation by using Historical Knowledge
Recovery and Backup system in which the process involves that copying and archiving of data on different cloud server, so that this data is used to recover the unique data, afterward a loss event. Purpose of backup is to recover data after its loss and to improve data from a past time. In backup systems, the fragments of every data file are physically distributed over multiple servers, which in...
متن کاملHPDedup: A Hybrid Prioritized Data Deduplication Mechanism for Primary Storage in the Cloud
Eliminating duplicate data in primary storage of clouds increases the cost-efficiency of cloud service providers as well as reduces the cost of users for using cloud services. Most existing primary deduplication techniques either use inline caching to exploit locality in primary workloads or use postprocessing deduplication running in system idle time to avoid the negative impact on I/O perform...
متن کاملA Survey On: Secure Data Deduplication on Hybrid Cloud Storage Architecture
Data deduplication is one of the most important Data compression techniques used for to removing the duplicate copies of repeating data and it is widely used in the cloud storage for the purpose of reduce the storage space and save bandwidth. To keep the confidentiality of sensitive data while supporting the deduplication, to encrypt the data before outsourcing convergent encryption technique h...
متن کاملSimilarity and Location Aware Scalable Deduplication System for Virtual Machine Storage Systems
I.INTRODUCTION In this paper with the potentially unlimited storage space offered by cloud providers, users tend to use a large amount space as they can and vendors continually look for techniques aimed to reduce redundant data and exploit space savings. A technique which has been widely adopted is crossuser deduplication. The simple idea behind deduplication is to accumulate duplicate data onl...
متن کامل